
Page 1 of 7 



^ OIPE 



RAW SEQUENCE LISTING DATE: 12/18/2001 

PATENT APPLICATION: US/09/848,852 TIME : 16:52:06 



Input Set : N:\Crf3\RULE60\09848852.raw 
Output Set: N:\CRF3\12182001\I84 8852.raw 



SEQUENCE LISTING 

3 (1) GENERAL INFORMATION: 

5 (i) APPLICANT: Hillman, Jennifer L. 

6 Tang, Y. Tom 

7 Corley, Neil C. 

8 Guegler, Karl J. 

9 Yue, Henry 

10 Patterson, Chandra 

12 (ii) TITLE OF INVENTION: PROTEINS ASSOCIATED WITH EGF SIGNALING 

14 (iii) NUMBER OF SEQUENCES: 5 

16 (iv) CORRESPONDENCE ADDRESS: 

17 (A) ADDRESSEE: Incyte Pharmaceuticals, Inc. 

18 (B) STREET: 3174 Porter Dr. 

19 (C) CITY: Palo Alto 

20 (D) STATE: CA 

21 (E) COUNTRY: USA 

22 (F) ZIP : 94304 

24 (V) COMPUTER READABLE FORM: 

25 (A) MEDIUM TYPE: Diskette 

26 (B) COMPUTER: IBM Compatible 

27 (C) OPERATING SYSTEM: DOS 

28 (D) SOFTWARE: FastSEQ for Windows Version 2.0 
30 (vi) CURRENT APPLICATION DATA: 

C--> 31 (A) APPLICATION NUMBER: US/09/848,852 

C--> 32 (B) FILING DATE: 04-May-2001 

34 (vii) PRIOR APPLICATION DATA: 

35 (A) APPLICATION NUMBER: 09/069,725 

36 (B) FILING DATE: 

40 (Viii) ATTORNEY/AGENT INFORMATION: 

41 (A) NAME: Billings, Lucy J. 

42 (B) REGISTRATION NUMBER: 36,749 

4 3 (C) REFERENCE/DOCKET NUMBER: PF-0515 US 

45 (ix) TELECOMMUNICATION INFORMATION: 

46 (A) TELEPHONE: 650-855-0555 

47 (B) TELEFAX: 650-845-4166 
50 (2) INFORMATION FOR SEQ ID NO : 1: 

52 (i) SEQUENCE CHARACTERISTICS: 

53 (A) LENGTH: 298 amino acids 

54 (B) TYPE: amino acid 

55 (C) STRANDEDNESS : single 

56 (D) TOPOLOGY: linear 

58 (vii) IMMEDIATE SOURCE: 

59 (A) LIBRARY: COLNNOT16 

60 (B) CLONE: 1281694 

62 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1: 

64 Met Asn Asp Ser Leu Arg Thr Asp Val Phe Val Arg Phe Gin Pro Glu 

65 1 5 10 15 



ENTERED 
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66 Ser lie Ala Cys Ala 

67 20 

68 Pro Leu Pro Asn Arg 

69 35 

70 Glu Glu lie Gin Glu 

71 50 

72 Lys Lys Val Asp Leu 

73 65 

74 His Ala lie Glu Glu 

75 85 

76 Gly Thr Gin Val Leu 

77 100 

78 Leu Val Glu Ser Pro 

79 115 

80 Ser Val Lys Asn Thr 

81 130 

82 Ala Asp Ser Pro Val 

83 145 

84 Arg Ser Arg Ser Arg 

85 165 

86 Ala Ser Pro Lys Arg 

87 180 

88 Ser Lys Ser Gin Ser 

89 195 

90 Gin Ala Pro Arg Ser 

91 210 215 220 

92 Arg Lys Ser Lys Asp Cys Lys Tyr Pro Gin Lys Pro His Lys Ser Arg 

93 225 230 235 240 

94 Ser Arg Ser Ser Ser Arg Ser Arg Ser Arg Ser Arg Glu Arg Ala Asp 

95 245 250 255 

96 Asn Pro Gly Lys Tyr Lys Lys Lys Ser His Tyr Tyr Arg Asp Gin Arg 

97 260 265 270 

98 Arg Glu Arg Ser Arg Ser Tyr Glu Arg Thr Gly Arg Arg Tyr Glu Arg 

99 275 280 285 

100 Asp His Pro Gly His Ser Arg His Arg Arg 

101 290 295 

103 (2) INFORMATION FOR SEQ ID NO: 2: 

105 (i) SEQUENCE CHARACTERISTICS: 

106 (A) LENGTH: 3464 base pairs 

107 (B) TYPE: nucleic acid 

108 (C) STRANDEDNESS : single 

109 (D) TOPOLOGY: linear 

111 (vii) IMMEDIATE SOURCE: 

112 (A) LIBRARY: COLNNOT16 

113 (B) CLONE: 1281694 

115 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2: 

117 TTCGTGAAGC ACTCCATGGA GCATGTGTCA ATGGCCTGTG TCCACCTGGC TTCCAAGATA 60 

118 GAAGAGGCCC CAAGACGCAT ACGGGACGTC ATCAATGTGT TTCACCGCCT TCGACAGCTG 120 

119 AGAGACAAAA AATAATCGTT ATGTACCTTC AGGTGTTAGA GTGTGAGCGT AACCAACACC 180 



Cys 


He 
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Leu 


Ala 


Ala 


Arg 


Thr 


Leu 


Glu 


He 
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30 






Pro 


His 


Trp 
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Leu 
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Phe 


Gly 


Ala 


Thr 
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45 








He 


Cys 


Leu 


Lys 


He 


Leu 


Gin 


Leu 


Tyr 


Ala 


Arg 
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60 










Thr 


His 


Leu 


Glu 


Gly 


Glu 


Val 


Glu 


Lys 


Arg 


Lys 


70 










75 










80 


Ala 


Lys 


Ala 


Gin 


Ala 


Arg 


Gly 


Leu 


Leu 


Pro 


Gly 
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Asp 


Gly 


Thr 


Ser 


Gly 


Phe 


Ser 


Pro 


Ala 


Pro 


Lys 
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110 






Lys 


Glu 


Gly 


Lys 


Gly 


Ser 


Lys 


Pro 


Ser 


Pro 


Leu 






120 










125 








Lys 


Arg 


Arg 


Leu 


Glu 


Gly Ala 


Lys 


Lys 


Ala 


Lys 
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140 
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Gly 
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Pro 


Lys 


Gly Arg 


Glu 


Ser 


Arg 


Ser 


150 










155 
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Glu 


Gin 


Ser 
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Ser 


Arg 


Ser 
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Ser 


Arg 


Ser 
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Arg 
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Ser 


Asp 


Ser 


Gly 


Ser 


Thr 


Ser 


Gly 


Gly 
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Arg 


Ser 


Arg 


Ser 


Arg 


Ser 


Asp 


Ser 


Pro 


Pro 


Arg 






200 










205 








Ala 


Pro 


Tyr 


Lys 


Gly 


Ser 


Glu 


He 


Arg 


Gly 


Ser 



file://C:\CRF3\Outhold\VsrI848852.htm 



12/18/01 




Page 3 of 7 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/84 8,852 



DATE: 12/18/2001 
TIME: 16:52:06 



Input Set : N:\Crf3\RULE60\09848852.raw 
Output Set: N:\CRF3\12182001\I848852.raw 



120 TGGTCCAGAC CTCATGGGTA GCCTCTGAGG GTAAGTGACT AAGACTTCTC CTCTGCTGTC 

121 CAAGCGCTTT GGTGCAGGGA CAGCGGCATC TTCAGCCAAT CCAGTGCAGG CTCTCCACCG 

122 AAGGCTGGCT CTAGACTGGT GACCCCTTGT TGAAATGGGA CAGTTGGCAG CGGCTCTGAT 

123 GAGCCCGAGA AGAGGCCTGC CCTTGGGTGC GGAGTCTCCC TCCGCACGAT GCTCCCACGC 

124 GTCCAACTTG CACCCAAGGG GCTTTTCCCT CTTCCAAGTG GACTCCTTCA AGGAAGCTGC 

125 AGCTCGGTCA GCAGAGAAGG GGCCTGCCGC CAGCGCCCTG GAGGAAGAGG AAGAGGAACC 

126 CAAGAGGATG GCTTGTCTCC CAGCAGCCAC ACCGGCTTTG TGCTCAGCCA GTTCATTTGA 

127 GTTTGCATGT TTCTCTGCAC TATGGATTTT GAGCATTTAG ATTTCTTTAA TCAAAAGCGT 
12 8 TTTAGTGACT CCAGTAGACA TTTTCTTTCT GAGGCATCGT GCTTTGCATG AGAGCAGGCC 

129 AAGGTTGAGG GGAAAAGTAA AGTTAAAGTC GGTTCTCTTT CATAGCAACA CGTATTGTCT 

130 GACATTCAGC CAGCTTTTTT TTTTTCTAAT AATTTCTGTG CCTTTCTGTC CTGTATTTAC 

131 TGTATTTAGA AAAAGCAGCT AGAATATTTC TCCATTAACT CTTGAGATTC ACAGGACTGT 

132 CTAGCTCTGA GTCCTAGCAA TAGACTCCTT AGAGGAGTAG TACGTTTATC TAGATTTTCT 

133 CTAGATAATG CAGGCGGAAG ACCTGGGTTC CCGGGTGGGG CATTGCAGTT CTTCCTGTGT 

134 TTGGCTTCCA GGAATTACAT GAACGACAGC CTTCGCACCG ACGTCTTCGT GCGGTTCCAG 

135 CCAGAGAGCA TCGCCTGTGC CTGCATTTAT CTTGCTGCCC GGACGCTGGA GATCCCTTTG 

136 CCCAATCGTC CCCATTGGTT TCTTTTGTTT GGAGCAACTG AAGAAGAAAT TCAGGAAATC 

137 TGCTTAAAGA TCTTGCAGCT TTATGCTCGG AAAAAGGTTG ATCTCACACA CCTGGAGGGT 

138 GAAGTGGAAA AAAGAAAGCA CGCTATCGAA GAGGCAAAGG CCCAAGCCCG GGGCCTGTTG 

139 CCTGGGGGCA CACAGGTGCT GGATGGTACC TCGGGGTTCT CTCCTGCCCC CAAGCTGGTG 

140 GAATCCCCCA AAGAAGGTAA AGGGAGCAAG CCTTCCCCAC TGTCTGTGAA GAACACCAAG 

141 AGGAGGCTGG AGGGCGCCAA GAAAGCCAAG GCGGACAGCC CCGTGAACGG CTTGCCAAAG 

142 GGGCGAGAGA GTCGGAGTCG GAGCCGGAGC CGTGAGCAGA GCTACTCGAG GTCCCCATCC 
14 3 CGATCAGCGT CTCCTAAGAG GAGGAAAAGT GACAGCGGCT CCACATCTGG TGGGTCCAAG 

144 TCGCAGAGCC GCTCCCGGAG CAGGAGTGAC TCCCCACCGA GACAGGCCCC CCGCAGCGCT 

145 CCCTACAAAG GCTCTGAGAT TCGGGGCTCC CGGAAGTCCA AGGACTGCAA GTACCCCCAG 

146 AAGCCACACA AGTCTCGGAG CCGGAGTTCT TCCCGTTCTC GAAGCAGGTC ACGGGAGCGG 
14 7 GCGGATAATC CGGGAAAATA CAAGAAGAAA AGTCATTACT ACAGAGATCA GCGACGAGAG 
14 8 CGCTCGAGGT CGTATGAACG CACAGGCCGT CGCTATGAGC GGGACCACCC TGGGCACAGC 

149 AGGCATCGGA GGTGAGGCGG GGTTGCAGTG ACTGGTGGCC GCAAGCCCTT CCCTGGGGAG 

150 TACCTGATGG CTGCCCTTTG ACCCCCGGTG GCTGCCCTTT GACCCCCGGG TGTGCTCTCA 

151 GCGCAAGTGG TCCTAGAACA GGATTCTTTT TGGAAATGTC TGTCGACTGG ACCTTGGTGG 

152 ATTTGGAAAT GGAACTGAGG GACCGGTGAC ACGTGCTTCA GACCGGTCTG GGGTGCGGCG 

153 CACACCTGGG CCCGTGCAGG GCTCAGCTCG GCAGCAGCTC TGAGGGCAGC TCAATGAAAA 

154 AGTGAATGCA CACGCCCTTG TTGGCGTGGC CTGGCATGGC CTGGTGCTAT CGGCAGCCGC 

155 TCTCCACTCC CCGACTGATA CTCAATTACG TGAAGCCAAG AAAGATGATT TTTAGAACCT 

156 TTGCCTATAT TAGGTTGTAC TTATGTACAT ATTTTGCAGT GTTTCACAGG AGAAAGTGGC 

157 CTTAACTGCC CCTTATTCTC TCTCCACGTT GTAAATAAAC ATGTGTTTAA TACAAGTTAA 

158 AGCTATGTAT GAAAACTCAG AACTTGAATC CCGTCAGCTT AAAACTTGTG TAGGGAATCC 

159 TGACTTTTAA AATGTGAGGG TATTTGGATC TGTGTTGAAA GTCGTATATT TTTATCTGTG 

160 CGGTGCTGAG TGCAGGCCAC CAGCTCCTAA ATAGAGGTTC CCTATATGCG CGTATGACAT 

161 GGTGAATAAA CACAACTCTC TCCACTCAGG ACATCCGGAG CGTTATGGAC GTGGTAGGTG 

162 GTCGTTCTGT GTGCTTGTGA AAGTGTCCAG GCGTGTGCAC AGCCAGTGCG CCCACTTCCG 

163 GGCTCCTTGC TCCCTGCTGT ACTGAAGTTT TGGATTTTGC ATCCAATCCT GTGTGCCTGC 

164 CCTTCTGCCG AAGCTTGTGA GGGGCCTGAG TCCTCTGCCC ATCAGGATGA CAGGCTCCTT 

165 CCTGCAGGGC CATAGGAGGG AAGTTTTGGA AACACAGAAT GATTCCAAGG TGCTCTCGTT 

166 CCTGAGGGGG ACTGGTTTGT AACCCATGAC ATCTGTGGGC GAGAGAGGCA GCTGGGAGCA 

167 GGACACTTGG AGGGTCACCC CACGGGGGTG GCACCTGCAC TCTGAGTGCC CCCCACTGTC 

168 ATCAGCTGCC TCTTACCGTG GACACAGTTT TGGTTTTGGG GACTAGGGGG CCCCACTCCT 



240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
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2100 
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3060 
3120 
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169 GGTGGTACCG TTTGGACTTA CTAGGGCAGT GGGACATATA GGCCGGGGCT AGTGGGATAA 3180 

170 CGGGGAGTTA CGCCTGATGA CTTTTTTGAT GGAATCCTGC ATTAGATAGC TGGTGGGACC 324 0 

171 CCCCCCTCAG AATTGGGGAA CTGAGGAGAC TCCAGGGAGG GTGTCCTTCC AGGGAGAGCA 3 300 

172 GCTATGAGGG GCCCCCTAGC TTCCTGTGCC TGGAAGTAAG AGAACCAGTA AAGGGCCATA 3 360 

173 CACACCTGTA CCCAAGAGAC CGCTCTCCAT TTGCTTTCTT TTTTTACTAA ATAATTGTAA 3420 

174 AATATTATTA TGACATAAAG AACCATTTAA GGCCAAAAAA AAAA 34 64 
176 (2) INFORMATION FOR SEQ ID NO: 3: 
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(i) 


SEQUENCE CHARACTERISTICS: 
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i TYPE: < 


amino acid 
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i STRANDEDNESS : i 


single 
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i TOPOLOGY: 


linear 
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IMMEDIATE SOURCE: 




















185 






(A) 


t LIBRARY: BEPINOT01 
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(B) 


CLONE: 


2056178 




















188 




(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO 


: 3: 
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Met 


Tyr 


Ser 


Ala 


Gin 


Arg 


Phe 


Trp 


Gly 


Thr 


He 


Trp 


Ala 


Arg 


Arg 


Gly 


191 


1 








5 










10 










15 




192 


Ala 


His 


Leu 


Ala 


Pro 


Pro 


Asp 


Ala 


Ser 


He 


Leu 


He 


Ser 


Asn 


Val 


Cys 


193 








20 










25 










30 






194 


Ser 


He 


Gly 


Asp 


His 


Val 


Ala 


Gin 


Glu 


Leu 


Phe 


Gin 


Gly 


Ser 


Asp 


Leu 


195 






35 










40 










45 








196 


Gly 


Met 


Ala 


Glu 


Glu 


Ala 


Glu 


Arg 


Pro 


Gly 


Glu 


Lys 


Ala 


Gly 


Gin 


His 


197 




50 










55 










60 










198 


Ser 


Pro 


Leu 


Arg 


Glu 


Glu 


His 


Val 


Thr 


Cys 


Val 


Gin 


Ser 


He 


Leu 


Asp 


199 


65 










70 










75 










80 


200 


Glu 


Phe 


Leu 


Gin 


Thr 


Tyr 


Gly 


Ser 


Leu 


He 


Pro 


Leu 


Ser 


Thr 


Asp 


Glu 


201 










85 










90 










95 




202 


Val 


Val 


Glu 


Lys 


Leu 


Glu 


Asp 


He 


Phe 


Gin 


Gin 


Glu 


Phe 


Ser 


Thr 


Pro 


203 








100 










105 










110 






204 


Ser 


Arg 


Lys 


Gly 


Leu 


Val 


Leu 


Gin 


Leu 


He 


Gin 


Ser 


Tyr 


Gin 


Arg 


Met 


205 






115 










120 










125 
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Gly 


Asn 


Ala 


Met 


Val 


Arg 


Gly 
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Arg 


Val 


Ala 


Tyr 


Lys 


Arg 


His 


207 
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135 










140 










208 


Val 
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Thr 


Met 


Asp 


Asp 


Leu 


Gly 


Thr 


Leu 


Tyr 


Gly 
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Asn 


Trp 


Leu 


209 


145 










150 
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210 
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Asp 


Gin 


Val 


Met 


Asn 


Met 


Tyr 


Gly 


Asp 


Leu 


Val 


Met 


Asp 


Thr 


Val 


211 
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170 
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Pro 


Glu 


Lys 


Val 


His 


Phe 


Phe 


Asn 


Ser 


Phe 


Phe 


Tyr 


Asp 


Lys 


Leu 


Arg 


213 








180 










185 
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214 


Thr 


Lys 


Gly 


Tyr 


Asp 


Gly 


Val 
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Arg 


Trp 


Thr 
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Val 


Asp 


He 


215 
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Lys 
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Leu 
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He 
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He 


His 


Leu 


Glu 


Val 


His 
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Leu 


He 
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Val 


Asp 


Val 


Arg 


Arg 


Arg 


Thr 


He 


Thr 


Tyr 


Phe 


Asp 


219 


225 










230 










235 
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220 


Ser 


Gin 


Arg 


Thr 
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Asn 


Arg 
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He 


Ala 


Lys 


Tyr 
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223 260 265 270 

224 Trp Lys Gly Tyr Phe Lys Met Asn Val Ala Arg Gin Asn Asn Asp Ser 

225 275 280 285 

226 Asp Cys Gly Ala Phe. Val Leu Gin Tyr Cys Lys His Leu Ala Leu Ser 

227 290 295 300 

22 8 Gin Pro Phe Ser Phe Thr Gin Gin Asp Met Pro Lys Leu Arg Arg Gin 

229 305 310 315 320 

230 He Tyr Lys Glu Leu Cys His Cys Lys Leu Thr Val 

231 325 330 
233 (2) INFORMATION FOR SEQ ID NO : 4: 

23 5 (i) SEQUENCE CHARACTERISTICS: 

236 (A) LENGTH: 1991 base pairs 

237 (B) TYPE: nucleic acid 

238 (C) STRANDEDNESS : single 

239 (D) TOPOLOGY: linear 

241 (vii) IMMEDIATE SOURCE: 

242 (A) LIBRARY: BEPINOT01 

243 (B) CLONE: 2056178 

24 5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 

247 GCCTCCCTGT CCCCCGACCC TCTTTTGATG CCTCAGCAAG TGAAGAGGAG GAAGAAGAGG 60 

248 AGGAGGAGGA GGATGAAGAT GAAGAGGAGG AAGTGGCAGC TTGGAGGCTG CCCCCAAGAT 120 
24 9 GGAGTCAGCT GGGAACCTCC CAGCGGCCCC GCCCTTCCCG CCCCACTCAT CGAAAAACCT 180 

250 GCTCACAGCG CCGCCGCCGA GCCATGAGAG CCTTCCGGAT GCTGCTCTAC TCAAAAAGCA 240 

251 CCTCGCTGAC ATTCCACTGG AAGCTTTGGG GGCGCCACCG GGGCCGGCGG CGGGGCCTCG 300 

252 CACACCCCAA GAACCATCTT TCACCCCAGC AAGGGGGTGC GACGCCACAG GTGCCATCCC 3 60 

253 CCTGTTGTCG TTTTGACTCC CCCCGGGGGC CACCTCCACC CCGGCTGGGT CTGCTAGGTG 4 20 

254 CTCTCATGGC TGAGGATGGG GTGAGAGGGT CTCCACCAGT GCCCTCTGGG CCCCCCATGG 4 80 

255 AGGAAGATGG ACTCAGGTGG ACTCCAAAGT CTCCTCTGGA CCCTGACTCG GGCCTCCTTT 540 

256 CATGTACTCT GCCCAACGGT TTTGGGGGAC AATCTGGGCC AGAAGGGGAG CGCACTTGGC 600 
2 57 ACCCCCTGAT GCCAGCATCC TCATCAGCAA TGTGTGCAGC ATCGGGGACC ATGTGGCCCA 660 

258 GGAGCTTTTT CAGGGCTCAG ATTTGGGCAT GGCAGAAGAG GCAGAGAGGC CTGGGGAGAA 720 

259 AGCCGGCCAG CACAGCCCCC TGCGAGAGGA GCATGTGACC TGCGTACAGA GCATCTTGGA 780 

260 CGAATTCCTT CAAACGTATG GCAGCCTCAT ACCCCTCAGC ACTGATGAGG TAG TAG AG AA 84 0 
2 61 GCTGGAGGAC ATTTTCCAGC AGGAGTTTTC CACCCCTTCC AGGAAGGGCC TGGTGTTGCA 900 
2 62 GCTGATCCAG TCTTACCAGC GGATGCCAGG CAATGCCATG GTGAGGGGCT TCCGAGTGGC 960 

263 TTATAAGCGG CACGTGCTGA CCATGGATGA CTTGGGGACC TTGTATGGAC AGAACTGGCT 1020 

264 CAATGACCAG GTGATGAACA TGTATGGAGA CCTGGTCATG GACACAGTCC CTGAAAAGGT 1080 

265 GCATTTCTTC AATAGTTTCT TCTATGATAA ACTCCGTACC AAGGGTTATG ATGGGGTGAA 114 0 

266 AAGGTGGACC AAAAACGTGG ACATCTTCAA TAAGGAGCTA CTGCTAATCC CCATCCACCT 1200 

267 GGAGGTGCAT TGGTCCCTCA TCTCTGTTGA TGTGAGGCGA CGCACCATCA CCTATTTTGA 1260 

268 CTCGCAGCGT ACCCTAAACC GCCGCTGCCC TAAGCATATT GCCAAGTATC TACAGGCAGA 1320 

269 GGCGGTAAAG AAAGACCGAC TGGATTTCCA CCAGGGCTGG AAAGGTTACT TCAAAATGAA 13 80 

270 TGTGGCCAGG CAGAATAATG ACAGTGACTG TGGTGCTTTT GTGTTGCAGT ACTGCAAGCA 1440 

271 TCTGGCCCTG TCTCAGCCAT TCAGCTTCAC CCAGCAGGAC ATGCCCAAAC TTCGTCGGCA 1500 

272 GATCTACAAG GAGCTGTGTC ACTGCAAACT CACTGTGTGA GCCTCGTACC CCAGACCCCA 1560 

273 AGCCCATAAA TGGGAAGGGA GACATGGGAG TCCCTTCCCA AGAAACTCCA GTTCCTTTCC 1620 

274 TCTCTTGCCT CTTCCCACTC ACTTCCCTTT GGTTTTTCAT ATTTAAATGT TTCAATTTCT 1680 

275 GTATTTTTTT TTCTTTGAGA GAATACTTGT TGATTTCTGA TGTGCAGGGG GTGGCTACAG 1740 

276 AAAAGCCCCT TTCTTCCTCT GTTTGCAGGG GAGTGTGGCC CTGTGGCCTG GGTGGAGCAG 1800 
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